6 ? ) Automatic Grammar Induction from Semantic Parsing

نویسندگان

  • Debajit Ghosh
  • James R. Glass
  • David Goddeau
چکیده

In this thesis, we investigate an approach for grammar induction that relies on semantics to drive the automatic learning of syntax rules. Specifically, we develop a semantic parser, which parses utterances based on "meaning," rather than syntax, by combining only words and phrases that satisfy a given set of semantic constraints. We subsequently extract a syntactic grammar from the resulting semantic-level phrases, trying various approaches to generalize this grammar. We evaluate the learned grammar utilizing two sets of experiments, restricting our test sets to semantically valid utterances. First, we use the grammar to parse new utterances from the same domain in which it was learned; the learned grammar covers 98% of the utterances handled by the semantic constraints. Second, we parse utterances from a new domain, assessing the portability of the grammar. Here, the grammar covers 85% of the semantically valid utterances. Semantic parsing proves to be a very powerful and useful mechanism, independent of syntax, providing an utterance's "meaning representation" directly. Furthermore, our experiments illustrate that this technique has potential for automatically developing portable grammars, making the task of moving an understanding system to a new domain easier. Company Supervisor: Dr. David Goddeau Title: Research Staff, DIGITAL Cambridge Research Laboratory Thesis Supervisor: Dr. James R. Glass Title: Principal Research Scientist

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic grammar induction from semantic parsing

This research investigates using semantic information to learn syntax rules automatically. After describing a semantic parsing mechanism for parsing utterances based on meaning, we illustrate a grammar induction technique which uses semantic parsing’s results to create syntactic rules. We also present and discuss several experiments which use the learned grammar in syntactic parsing experiments...

متن کامل

Semi-automatic acquisition of domain-specific semantic structures

This paper describes a methodology for semi-automatic grammar induction from unannotated corpora belonging to a restricted domain. The grammar contains both semantic and syntactic structures, which are conducive towards language understanding. Our work aims to ameliorate the reliance of grammar development on expert handcrafting or the availability of annotated corpora. To strive for a reasonab...

متن کامل

برچسب‌زنی خودکار نقش‌های معنایی در جملات فارسی به کمک درخت‌های وابستگی

Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...

متن کامل

Approach to Automatic Translation Template Acquisition Based on Unannotated Bilingual Grammar Induction

In this paper, we propose a new approach which can automatically acquire translation templates from the unannotated bilingual spoken language corpora in the domain of travel information accessing. In the approach, two basic algorithms named grammar induction algorithm and dynamic programming algorithm are adopted. Our approach is an unsupervised, statistical, data-driven method which avoids the...

متن کامل

Tiny Corpus Applications with Transformation-Based Error-Driven Learning : Evaluations of Automatic Grammar Induction and Partial Parsing of SaiSiyat

This paper reports a preliminary result on automatic grammar induction based on the framework of Brill and Markus (1992) and binary-branching syntactic parsing of Esperanto and SaiSiyat (a Formosan language). Automatic grammar induction requires large corpus and is found implausible to process endangered minor languages. Syntactic parsing, on the contrary, needs merely tiny corpus and works alo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009